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CONSTRUED  AS  AN  OFFICIAL  DEPARTMENT  OF  THE 
ARMY  POSITION,  UNLESS  SO  DESIGNATED  BY  OTHER 
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Abstract 


There  are  k (>^  2)  competing  normal  populations  with  common  known 
variance  and  unknown  means  **  ’®k*  Let  ®[1]  — ***—  0[k]  denote 

the  ordered  values  of  the  {6^}.  Nothing  i3  known  concerning  the 
pairing  of  the  {(h)  and  the  . In  the  location  invariant 

identification  problem,  the  differences  t0[j]  “ 6[1]^  are  given  and 

it  is  desired  to  select  the  population  associated  with  Of 

particular  interest  is  the  slippage  configuration  0n]  = ®[k-l]  = ®[k]  “ 
where  > 0 is  given.  We  restrirnr  attention^to  procedures  that 
guarantee  that  the  population  with  the  largest  mean  is  correctly  selected 

K. 

with  probability  at  least  P*  where  1c  < P*  < 1 is  preassigned. 

A 

Essentially,  this  requirement  is  satisfied  by  the  stopping  rule  -of 


Bechhofer,  Kiefer  and  Sobel  (Sequential  Identifipatrign~and^ Ranking 
Procedures , Univ.  of  Chicago  Press  1968,  Chap.  independently  of  the 
sampling  rule  and  thus  data  dependent  allocation  rules  can  be  considered. 
Unlike  the  case  k = 2^<5ee  Robbins  and  Sicgmund  (1974)  J.  Am.  Statist. 
Assoc. , 69,  132-3.39) when  k 3,  substantial  savings  in  expected  total 
sample  size  can  be  obtained  when  adaptive  sampling  i3  used  instead  of 
the  equal  allocation  rule  ("^vector  at  a t ime^sampling ) . Several  procedures 
are  proposed  and  investigated  with  respect  to  this  criterion.  Also  their 


performance  with  regard  to  the  alternative  criterion  of  mir.  i/^ing 
expected  number  of  observations  on  the  inferior  populations  is  studied. 
Both  theoretical  and  simulation  results  are  presented.  Comparisons  are 
made  with  the  performance  of  elimination-type  procedures,  such  as  that 
of  Paulson  (1964),  Ann.  Math.  Statist.  , 35,  174-180. 

Finally,  for  the  ranking  problem  where  the  differences  8^^  - 6^ 


•J, 


(2  < I <_  k)  are  now  unknown.  It  is  shown  that,  when  adaptive  sampling 
rules  are  used,  the  slippage  configuration  is  no  longer  necessarily  least 
favorable  for  the  usual  indifference  zone  approach.  This  poses  some 
interesting  problems  for  future  research. 


! 


Adaptive  sampling  procedures  have  been  the  subject  of  considerable, 
theoretical  interest  in  the  methodology  of  sequential  medical  trials, 
where  it  is  desired  to  compare  k (>_  2)  treatments.  A recent  survey 
article  by  Hoel,  Sobel  and  Weiss  (1975)  provides  a comprehensive  overview 
of  the  literature.  Most  authors  have  concentrated  on  the  two-armed 
trial,  k = 2,  but  studies  with  three  or  more  arms  are  important.  Gent 
(1976)  describes  a situation  in  which  three  independent  nationwide  trials 
involving  aspirin  substitutes  were  begun  around  the  same  time.  He  points 
out  that,  instead  of  conducting  three  separate  2-armed  trials  of  drug  vs. 
control,  a single  4-armed  study  would  have  resulted  in  a considerable 
saving  in  the  number  of  control  subjects  needed. 

However,  adaptive  sampling  might  be  advantageous  in  other  areas  of 
application.  In  a quality  control  situation  there  may  be  k batches  of 
items  which  deteriorate  in  time  (e.g.  food,  drugs,  etc.)  and  due  to 
records  being  lost  it  is  not  known  which  batch  is  the  newest  (oldest) 
unless  sampling  is  undertaken.  Alternatively  there  may  be  k apparently 
identical  lots  of  items  but  of  varying  quality  and  it  is  desired  to  pick 
the  superior  one.  Another  application  occurs  when  the  observations  are 
radar  measurements  on  the  nose  cone,  booster  and  other  debris  of  a 
rocket  after  separation  and  it  is  desired  to  identify  the  nose  cone  as 
quickly  as  possible  e.g.  for  guidance  or  for  possible  counter-measures 
if  the  nose  cone  is  a warhead  and  the  other  objects  are  decoys.  There 
have  been  several  papers  on  the  subject  of  how  search  radars  could  be 
improved  by  non-uniform  scan  controlled  by  a sequential  detector  - see 
e.g.  Edrington  and  Petersen  (1971). 
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Consider  a k-ary  communication  channel  with  feedback.  The  receiver 
sequentially  processes  the  signal  corresponding  to  one  of  the  k possible 
characters  until  it  identifies  it  with  some  degree  of  certainty  - the 
receiver  then  directs  the  transmitter  to  send  the  next  character.  In 
this  application  (and  possibly  others),  there  is  a single  input  process 
with  k hypotheses.  If  so,  the  k populations  may  be  taken  as  the 

output  processes  of  k filters.  If  the  k possible  signals  are 
orthogonal  and  the  noise  Gaussian,  then  matched  filters  would  accomplish 
the  desired  transformation.  There  are  further  applications  to  unsupervised 
learning  and  classification  techniques  — see  Nagy  and  Tolaba  (1972). 

An  adaptive  sequential  procedure  consists  of  three  parts  — a 
sampling  rule  (also  called  assignment  or  allocation  algorithm)  for 
deciding  from  which  population  the  next  observation  is  to  be  taken,  a 
stopping  rule  for  deciding  when  sampling  should  cease,  and  a terminal 
decision  rule  for  selecting  a population  as  '‘best".  Robbins  and  Siegmund 
(1974)  treat  the  problem  of  deciding  which  of  two  normally  distributed 
treatments  with  a common  known  variance  has  the  larger  mean  response.  They 
show  how  to  construct  a location  invariant  stopping  rule  and  terminal 
decision  rule  of  sequential  probability  ratio  test  type  (SPRT)  so  that 
the  error  probabilities  are  essentially  independent  of  the  (symmetric) 
sampling  rule  used.  In  particular  the  SPRT  can  be  constructed  to 
guarantee  that  the  probability  (PCS)  of  correctly  selecting  the  treatment 
with  the  larger  mean  is  at  least  P*  wherever  the  difference  of  the  means 
is  no  less  than  6*.  Here  6*  > 0 and  1/2  < P*  < 1 are  preassigned. 

They  then  consider  various  sampling  rules  with  respect  to  two  measures  of 
performance,  namely  (A)  expected  total  number  of  observations  (ASN)  and 
(B)  expected  total  number  of  observations  on  the  inferior  population  (ITN). 
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They  showed  that  the  ASN  was  minimized  by  pairwise  (or  vector-at- 
a-time  VT)  sampling.  They  also  show  how  adaptive  sampling  rules  which 
reduce  the  ITN  necessarily  cause  a considerable  increase  in  ASN.  (As 
we  shall  demonstrate  later,  this  result  is  in  sharp  contrast  with  the  case 
k ^ 3 where  the  two  goals  (A),  (B)  are  no  longer  in  direct  conflict.) 
Subsequently  Louis  (1975)  obtained  the  optimal  allocation  rule  for 
minimizing  ASN  + y*  ITN  (for  y > 0)  in  the  continuous  time  analogue  of 
this  k » 2 problem. 

In  this  paper,  we  attempt  to  attack  the  k (>_  3)  treatment  problem 
in  the  same  spirit  as  Robbins  and  Siegmund.  Bechhofer,  Kiefer  and 
Sobel  (1968)  [BKS]  treat  in  great  detail  vector-at-a-time  (VT)  sampling  for 
selecting  the  best  of  k populations  for  a stopping  rule  which  is  a 
generalization  of  the  SPRT.  However,  when  k >_  3,  great  savings  in  ASN 
can  be  made  if  the  sampling  rule  is  allowed  to  be  adaptive  (i.e.  data- 
dependent).  As  mentioned  before,  such  rules  can  have  the  side  benefit  of 
producing  a lower  expected  total  number  of  observations  on  the  k - 1 
inferior  treatments  (ITN).  This  is  intuitively  clear;  for  consider  the 
normal  case  with  k = 3 and  when  one  population  mean  is  much  smaller  than 
the  others.  Very  early  on  in  experimentation,  this  population  will  become 
apparent  and  only  two  populations  will  be  left  in  contention.  Thus, 
compared  to  VT  sampling,  we  might  expect  that  the  ASN  can  be  reduced  by 

almost  half;  Correspondingly  larger  savings  could  occur  with  larger  values 
of  k.  In  fact,  although  previous  authors  have  treated  small  values 
of  k (usually  k = 2),  there  are  applications  where  k may  be  as 
large  as  1000.  That  ASN  can  be  reduced  by  adaptive  sampling  is  important 
since  recently  Byar  et  al.  (1976)  have  suggested  that  it  is  ASN  and 
not  ITN  which  should  be  the  more  important  criterion  for  clinical  trials. 
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In  the  next  section,  we  describe  a k-population  location  invariant 
(LI)  identification  problem  and  propose  a stopping/terminal  decision  rule 
with  the  property  that  its  PCS  is  independent  of  the  Sampling  rule  used.  In 
Section  3,  we  describe  some  adaptive  allocation  rules , and  also,  in  Section 
4,  the  elimination- type  procedures  of  Paulson(1964)  which  can  also  be  viewed 
as  an  adaptive  rules.  In  Section  5,  these  rules  are  compared  via  Monte 
Carlo  simulation.  In  Section  6,  we  investigate  the  ranking  problem  associated 
with  the  LI  identification  problem  of  Section  2.  However,  unlike  the  case 
k = 2,  or  the  case  of  k _>  2 with  VT  sampling,  we  find  that  the  usual 
slippage  configuration  is  no  longer  "least  favorable"  and  thus  the  results 
of  Section  2 cannot  be  extended.  This  is  a somewhat  surprising  result 
(at  least  to  the  authors).  All  the  above  ideas  will  be  made  more  precise 
i ;ctions  that  follow. 
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2.  The  location  invariant  (LI)  identification  problem 

Observations  (1<  i <k,  j 1)  are  available  sequentially 

from  each  of  k (>^  2)  populations  n2 » * • * *11^ * The  observations 

{Xij}  are  independent  with  probability  density  function  f(x  - 6^) 
where  0.  is  a real  location  parameter.  We  let  0,,-,  < <0n.  , < 0r 


’[1]  1 


-w[k-l]  - °[k] 


denote  the  ranked  values  of  the  {0^}.  We  assume  that  nothing  is  known 
a priori  concerning  the  pairings  of  the  {0|-^j}  and  the  Also  we 


define  t.  = 0.  - 0r,n  and  let  xr.n 
i i [lJ  [i] 


0r.n  - 0r,n  be  the  ranked 
Li]  LI] 


values  of  the  {x^}.  If  the  values  of  the  are  known  ( unknown ) then  we 

say  we  have  an  identification  (ranking)  problem.  Here  we  shall  consider 

the  location  invariant  (LI)  identification  problem  where  only  the  differences 

{tj-^-j}  are  known.  We  shall  discuss  the  construction  of  adaptive  sequential 

procedures  which  guarantee  that  the  probability  of  correctly  selecting 

(PCS)  that  population  associated  with  T[)<]  is  at  least  P*,  where 

k ^ < P*  < 1 is  prespecified  (we  assume  Xj-^j  Is  unique  i.e. 

x^^  > T[k_1])*  As  in  Robbins  and  Siegmund  (1974),  by  invariance  we  can 

restrict  consideration  to  procedures  based  on  the  maximal  invariants 

Y..  = X . - X , 1 i < k,  j >_  1.  Suppose  that  we  have  taken  n. 

^■3  ^3  ^ 1 

observations  from  n.  and  define  N = J n.. 

1 1 1 

The  likelihood  function  based  on  the  (Y . . > is  given  by 

« k ni  ^ 

l*(t. ,x« > • • . >x. ) = j n n f(y*.  ~ x.  + x,  + 

1 z K i=i  j=i  11 


z)dz 


where  of  course  y = 0. 

Define  L(o)  = L*(T[ai]»T[a2]’' ' * ’T[ak])  Where  a e Sk’  the  grOUP  °f 
permutations  of  k elements.  Let  S^_^(i,j)  denote  the  subgroup  of 
permutations  a e S^  for  which  ai  = j . Define 
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I L(a) 
acSkl(i,k) 

l L(a) 
aeSk 


which  is  the  likelihood  that  x.  = x^-j  and  n.  is  the  ’’best" 
population.  (Q^  is  also  the  a posteriori  probability  of  this  event, 
assuming  the  {Ih}  are  all  equally  likely  to  be  the  best,  a priori.) 
Finally,  define 


Q * i5!5k  V 


The  BKS  stopping  rule  (and  terminal  decision  rule)  is  as  follows: 


terminate  sampling  the  first  time  that  Q > P* 
and  select  that  population  associated  with  the 
largest  value  of  Q.." 


(P  ) 

BKS 


Theorem.  If  P, 


BKS 


is  used  for  the  LI  identification  problem  above 


then  P[CS]  P*  for  any  symmetric  sampling  rule  such  that  the  procedure 
terminates  almost  surely. 

Proof.  This  is  essentially  contained  in  BKS  (Chap.  3). 

Remark  1.  Scale  parameters.  The  case  where  f(x|0^)  = f(x/0^)  is 

analogous  and  can  be  handled  either  by  a log  transformation  or  by 

considering  quotients  instead  of  differences. 

For  the  remainder  of  this  paper  we  will  treat  the  example  in  which  the 

2 

observations  {X..}  are  normal  with  means  {0.}  with  known  variance  o . 

i 

For  k = 2,  this  setup  has  been  recently  treated  by  Robbins  and  Siegmund 


(1974)  and  Louis  (1975).  Here  it  is  easy  to  show  that 

1 k ni 

L*(Tl»T2,...,Tk)  « exp [ l l ( 


k ni  _ 2 

2 u t ,x  - x - x + x)  ] 
2o  i=l  j=l  13  “ 


7 


r 


i n. 
k a 


_ i « * r. 

where  x = N 7 7 x . . and  t = N 7 n . x . . Note  that  x and 

* * ij  £_2  11 

r are  not  invariant  under  permutations  of  the  subscripts. 

We  specialize  to  the  6*-slippage  configuration  (Karlin  and  Truax, 
1960)  for  which  x^^  = 0 (1  <_  i <_  k - 1)  and  = 6*  > While  of 

interest  in  its  own  right  (e.g.  see  some  of  the  examples  in  Section  1), 
this  configuration  also  is  the  "least  favorable"  in  the  associated  ranking 
problem  with  VT  sampling  - see  Section  6.  Substituting  for  T[i]>  L(a) 
simplifies  and  for  a e S]<__1(i,k),  we  have 


. n . 

k l 


L(a)  = exp [7  7 (x  ~ x )2  - 26*z.] 

« * i-_i  j _i  hj  • • 1 


2o  h=l  j=l 


where 


z = n. (x.  - x ) 

X 1 1#  • • 


n.  (N-n. ) .,8 

6 


2N 


n. 


and  x 


i = n"1  I x... 

j=l  13 


Let  zq]  < • • • < z[)<]  denote  the  ranked  values  of  the  {z^}.  (Ties 


occur  with  probability  zero.) 
Then 

k-1 


1-Q 


Q = i^1  exP[_6,*'(2[k]  " z[i])/0  3 


and  PBKS  is  equivalent  to: 


(2.1)  "Stop  the  first  time  that  (1-  Q)/Q  (1  - P*)/P*  and  select 


that  population  associated  with  z 


[k]‘ 


1 


It  should  be  noted  that  for  non  VT  sampling  the  population  associated  with 
z^-j  is  not  necessarily  the  same  as  the  one  associated  with  }.<i<k  (x^  )• 


We  now  address  the  question  of  sufficient  conditions  for  the 
almost  sure  termination  of  rule  (2.1). 


Lemma  1.  for  the  6*-slippage  Ll-identification  problem,  we  suppose 
that  the  sampling  rule  used  is  such  that: 


(2.2)  P[n^(N)  -*■  ® as  N •+•  *o  for  each  1 < i < k]  » 1 


Then  Q ■*  1 as  N -*■  ® almost  surely.  It  follows  that  if  the  stopping 
rule  (2.1)  is  used,  termination  will  occur  with  probability  one. 


Proof.  We  first  show  that  X.  -►  0.  a.s.  for  l<i<k  as  N -*■  ® . 
l.  i — — 

2, 


The 


sequence  {X..;  j = 1,2,...}  is  i.i.d.  N(e^,o  ).  Hence,  by  the 

v 

strong  law  of  l^rge  numbers , J X^./v  ^ a.s.  as  v -*-  <*>  • The 


sequence  { 


r"'  :=-l 

.1  v = 1»2>*** 


j=i 


} by  "stuttering",  i.e.  repeating 


each  value  several  times  before  going  on  to  the  next.  Hence  X.  6. 
almost  surely. 

Now,  without  loss  of  generality  assume  = t^-j  (1  <_  i <_  k). 


Then 


\ --  - x'1  j,  ”A.>  - V - 


Hence 


k-1 


k-1 


^ *k**'*'-' 58/2  - < l -i> 


which  by. -.the  argument  above  converges  almost  surely  to  some  strictly 
positive  limit  as  N ®.  Thus  we  must  have  -*■  + ® a.s.  N -*•  ®. 
Similarly  for  i / k we  can  write 


* rt  )zi 


h.  - <"A/«  - -i» 


k-1  k-1 

( l nA  'l  ni)  - 6*/2 


1=1 


i*i 


! 

1 
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and  the  lim  sup  of  the  RHS  is  s *rictly  negative  almost  surely  as  N -*■  «. 
Thus  lim  sup  = -»  a.s.,  and  '1  - Q)/Q  ■*>  0 a. 3.  which  prov  » the 
lemma. 


Remark  2.  If  the  sampling  rule  is  suih  that  condition  (2.2)  is  r £ 

satisfied  then  Q may  not  converge  to  unity.  For  example,  suppose  that, 

with  non-zero  probability,  n^(N)  n.  < « for  some  i(l  i <_  k - 1) 

and  nk(N)  -►  » as  H + •.  Then  Zk  ~ ^ l ^(5^  - X - 6*/2)/N 

-n.  ][  n.  (X.  - X.  - 6*/2)/N  remains  finite  and  Q 1. 

1 J 3* 


In  this  section  we  describe  several  sampling  (allocation)  rules 
to  be  used  in  conjunction  with  the  stopping  rule  and  terminal  decision 
rule  given  by  (2.1).  For  convenience,  we  assume  that  all  rules  initially 
take  one  observation  from  each  population. 

(i)  VT  At  each  stage  a block  of  k observations  is  taken,  one  from 
each  population.  In  the  medical  context,  within  each  block,  patients 
are  assigned  at  random  to  each  of  the  k treatments.  A closely  related 
rule  is  the  one  in  which  observations  are  taken  singly,  one  observation 
from  each  population  in  turn.  The  ASN  of  this  rule  must  be  smaller  than 
that  of  the  VT  rule  yet  our  experience  suggests  that  the  difference 

is  small.  Extensive  tables  for  VT  sampling  are  available  (BKS,  Chap.  18), 
and  those  results  provide  a yardstick  against  which  our  adaptive 
sampling  results  can  be  compared. 

(ii)  RAND  Q This  is  a randomized  allocation  rule  in  which  the  next 

observation  is  taken  from  with  probability  (1  <_  i <_  k).  Recall 

that  for  the  LI  identification  problem  can  be  considered  the 

posterior  probability  that  is  "best".  (Note  that  the  nonrandomized 

rule  that  always  samples  from  the  population  associated  with  max  Q.  may 

i 1 

not  terminate  by  Remark  2.)  For  k = 2 and  Bernoulli  observations,  Simon, 
Weiss  and  Hoel  (1975)  have  investigated  this  rule. 

(iii)  GRS  Robbins  and  Siegmund  (1974  Sect.  3)  proposed  a non-randomized 
rule  for  k ■ 2.  Here  we  describe  one  particular  generalization  to  k ^ 2. 
From  the  stopping  rule  (2.1),  we  note  that  sampling  has  certainly  terminated 
if 
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i.e.  if  z[k]  - Z[k_i;]  > b 

where  b = a2  log[(k  - 1)P*/(1  - P*)]/6*.  The  GRS  rule  depends  on  a 
parameter  c > b and  is  as  follows:  If 


Z[k]  ' 2[k-l]  > n(k)  ~ n(k-l) 


take  the  next  observation  from  the  population  associated  with  z[jc]> 
otherwise  take  a vector  of  observations  from  the  remaining  k - 1 
populations.  Here  n^  denotes  the  sample  size  associated  with  zq]* 

For  k = 2 this  reduces  to  the  rule  of  Robbins  and  Siegmund.  For  our 
simulations,  two  values  of  c were  chosen,  namely  c/b  = 1 and  c/b  = 1.2; 
these  were  the  values  used  by  Robbins  and  Siegmund. 

(iv)  Bessler  Bessler  ( 1960  , Section  8)  treated  this  problem  with 
k = 3,  but  from  a decision  theoretic  viewpoint  in  which  there  are  costs 
assigned  to  incorrect  terminal  decisions  and  there  is  a cost  c per 
observation  — there  is  no  P*  requirement.  He  obtains  asymptotic 
properties  as  c -►  0 for  his  procedure.  Because  Bessler' s stopping  rule 
does  not  guarantee  error  probability  requirements  we  continue  to  use  the 
stopping  rule  (2.1)  and  consider  only  Bessler' s sampling  rule.  Let 

<_  £ x|-3-]  denote  the  ranked  values  of  the  {x^  , 1 <_  i £ 3). 

. . . . 2 

Define  C(x)  = [(x^-j  - x^  ^x[3]  “ x[2]  ^ ’ Bessler's  allocation 

rule  is  randomized  and  prescribes  that  the  next  observation  be  taken 
from  the  population  associated  with  x^^  with  probability  X^  (i=l,2,3), 
where  X = (X  ,X  ,X  ) maximizes  X_X0/(X0  + X»)  subject  to  the 
constraints  X^  + + X^  = 1 and 


the  quantity  C(^c)  is  estimating,  namely  C(t).  But  when  t = (0,0,6*), 

C(t)  = 1 and  it  is  easy  to  show  that  = ^2  = + *3  = ^7(2  + /2). 

Sampling  rule  (v)  is  therefore  to  sample  from  the  population  associated 

with  max  Q.  (i.e.  with  zr,-.)  with  probability  72/(2  + 72),  and  the 
l<i<3  1 _x 

other  two  each  with  probability  (2+72)  . These  are  the  same  limiting 

proportions  as  those  obtained  by  Bechhofer  (1969)  in  the  related  fixed 
sample  size  multiple  comparisons  problem  for  two  experimental  treatments 
and  a control.  When  k > 3,  the  analogous  rule  would  be  to  sample  from 
the  population  associated  with  with  probability  (1  + A.  -1)  1 

and  the  remaining  populations  with  equal  probability. 

If  should  be  noted  that  rule  (iii)  (GRS)  is  the  only  one  that 
does  not  involve  any  randomization.  From  the  remarks  of  Byar  et  al.  (1976) 
and  Bailar  (1976),  randomization  is  important  in  clinical  trials. 


i 
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4.  Paulson's  procedure 


Paulson  (1964),  considered  a procedure  based  on  VT  sampling  but 

featuring  permanent  elimination  of  non-contending  populations.  At  the 

n'th  stage  of  experimentation  let  In  be  the  set  of  populations  not  yet 

eliminated  (1^  consists  of  all  k populations).  An  observation  is 

taken  from  each  population  in  I and  the  cumulative  sample  sums 

n 

*Sin:  i e *n*  are  comPuted  where  Sin  = £ x^.  If 

k=l 


S. 


max 


S.  - (a  - nA)  + , then 


in  jel  3n 
n 

IK  is  eliminated.  Here  a > 0 and  0 H < 6*  are  specified  constants 

and  y+  = max(y,o).  The  procedure  continues  until  only  one  population 
remains  which  is  selected  as  best.  A desirable  property  is  that  when 
A > 0 the  number  of  stages  is  bounded  by  [a/A]  + 1.  Here  [y]  denotes 
the  integer  part,  of  y*.  _ Note  that  the  stopping  rule  is  different 
from  (2.1). 


Paulson  proved  that  if  a = log 

6 ‘-A 


k-1 

* 

1-P 


then  PCS  > P* 


whenever  t i <5*.  Paulson  originally  suggested  the  choice 
of  A = 6*/4,  whereas  Fabian  (1974)  recommended  A = 6*/ 2 since 
asymptotically  (P*  -*■  1)  this  choice  minimizes  the  maximum  ASN  over  all 
parameter  configurations.  The  unbounded  procedure  when  A = 0 is  also 
of  interest.  In  fact,  Paulson's  procedure  is  conservative  in  that  the 

PCS  guaranteed  is  considerably  greater  than  P*  even  in  the  LF 
6*-slippage  configuration.  Fabian  (1974)  proposed  a modification  of 
Paulson's  procedure  whereby  this  "overprotection"  is  reduced  and 
consequently  so  is  the  ASN.  This  modification  consists  of  replacing 
"1  - P*"  in  the  formula  for  "a"  by  a quantity  0 > 1 - P*.  For 


A = 0,  e = (1  - P*)/[l  - (1  - P*)/(k  - 1)];  for  x = 6*/4,  B 
is  the  solution  to  the  equation 


14 


1 - P*  = (SCI  - |(B/(k  - 1))1/3]; 

finally  for  A = 6*/2,  B = 2(1  - P*). 

A more  dr  led  comparison  via  simulation  of  the  BKS  (with  VI.  sampling) 
and  Paulson  p'.or*-  cures  with  regard  to  PCS.  and  ASN  for  a wide  range  of  P* 
values  can  be  found  in  unpublished  papers  of  Ramberg  (1966),  Bechhofer  and 
Ramberg  (1977).  The  asymptotic  relative  efficiency  of  the  2 procedures 
has  been  studied  by  Perng  (1969).  Paulson's  procedure  has  been 
extended  by  Hoel  and  Mazumdar  (1968)  and  Hoel  (1971). 
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5.  Simulation  results  for  the  LI  identification  problem 

A Monte  Carlo  study  was  performed  in  order  to  compare  performances 

with  regard  to  ASN,  ITN  and  PCS  of  the  procedures  described  in  Sections 

2 

3,4  for  the  g*-slippage  problem.  The  values  P*  = 0.9,  g*  = 0.2,  a = 1 

were  chosen  along  with  two  values  of  k,  namely  k = 3,  10.  The 

results  are  displayed  in  Tables  I, II.  These  values  were  chosen  so  that  our 

results  would  be  directly  comparable  with  those  tables  in  Chap.  18  of 
BKS.  Ten  procedures  were  considered  namely  (A)  FIXED  sample  size 
(Bechhofer,  1954);  (B)  VT;  (C)  RAND  Q;  (D)  GRS  with  c/b  = 1.0; 

(E)  GRS  with  c/b  = 1.2;  (F)  the  BESSLER  sampling  rule  with  stopping 
rule  (2.1);  (G)  the  ^kH:l:l  allocation  rule;  (H),  (I),  (J)  Paulson's 
procedure  with  the  Fabian  modification  with  1/6*  = .5,  .25,  0 respectively. 
These  are  abbreviated  PF(.5),  PF(.25),  PF(0). 

In  the  tables  that  follow,  the  FIXED  sample  sizes  of  the  non- 
sequential single  stage  procedure  were  taken  from  Table  I of  Bechhofer 
(1954).  For  k = 10,  simulations  were  only  performed  for  procedures 
(B),  (C),  (E),  (I).  This  was  for  reasons  of  economy  and  because  these 
seemed  the  most  premising.  In  any  event,  Bessler's  rule  is  not  easy  to 
obtain  explicitly  for  k = 10. 

The  tables  display  {N^,  1 <_  i £ k}  where  N^  is  the  mean  number 
of  observations  taken  from  the  population  associated  with  T[jj*  Also 
displayed  is  the  average  total  sample  size  ASN.  The  three  entries 
in  each  cell  are  the  estimated  mean,  its  standard  error  (in  parentheses), 
and  the  estimated  standard  deviation.  Also  tabulated  are  the  observed 
proportion  of  correct  selections  and  the  mean  value  of  Q upon 
termination.  The  latter  yields  a more  precise  unbiased  estimate  of  the 
PCS  for  the  6*-slippage  configuration  — see  BKS  page  289.  Q also  gives 


Table  II  Simulation  results 
10,  P*  = 0.9,  6*  = 0.2,  o 


0.912  0.907  0.909  0.941 
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a measure  of  "excess"  or  "overshoot". 

It  can  be  seen  from  Tables  I, II  that  the  adaptive  rules  possess 
some  distinct  advantages  over  VT.  For  k = 3,  the  decrease  in  ASN 
is  about  6%  for  k = 3 (Table  I)  and  the  savings  increase  to  about 
50%  for  k = 10  (Table  II).  Also  there  is  the  side  benefit  in  that 
the  ITN  can  be  reduced  by  about  22%  for  k = 3 and  the  savings  increases 
to  60%  for  k = 10.  Most  of  the  adaptive  rules  (C)  - (G)  perform  about 
the  same  — RAND  Q seems  to  do  slightly  better  with  regard  to  ASN,  GRS 
slightly  better  for  ITN.  Among  the  PF  rules,  the  results  here  are 

consistent  with  Paulson's  choice  of  X = 6*/4.  Of  course,  the  simulations 
are  only  for  P*  = 0.9,  6* /o  = 0.2,  but  we  believe  that  the  results 

are  indicative  of  what  happens  for  other  (P*,  6* /a)  combinations. 
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6.  The  ranking  problem. 

We  now  attempt  the  transition  from  the  LI  identification  to  the 
ranking  problem  where  we  assume  that  the  ordered  values  are  completely 

unknown.  In  the  indifference  zone  formulation  of  the  normal  means  ranking 
problem,  we  define  the  preference  zone  PZ  = { r | t ~ ^ >_  6*}  = 

{e|0[k]  ~ 'L  6*)  and  seek  procedures  that  guarantee  the  requirement 
that  PCS  > P*  wherever  t e PZ.  BKS  (Theorem  6.1.1)  prove  that  rule 
(2.1)  with  VT  sampling  satisfies  this  requirement.  Paulson  (1964)  showed 
that  his  procedure  also  meets  the  requirement.  (So  does  of  course  the 
fixed  sample  size  rule  of  Bechhofer  (1954).)  All  three  procedures 
utilize  the  fact  that,  for  a given  set  of  procedure  parameters  (P*,S*,etc. ) , 
the  infimum  of  the  PCS  over  all  9 in  PZ  is  achieved  in  the  6*-slippage 
configuration,  which  is  therefore  termed  "the  least-favorable"  (LF) 
configuration.  Thus  for  those  procedures  the  ranking  problem  can  be 
reduced  to  an  LI  identification  problem.  It  might  be  reasonably  conjectured 
that  any  procedure  using  stopping  rule  (2.1)  and  a symmetric  sampling  rule 
that  assures  temmination  w.p.  1 would  also  guarantee  PCS  >_  P*  whenever 
^ e PZ.  However  our  simulations  (see  Table  VII)  demonstrate  this  very 
intuitive  conjecture  to  be  false.  In  general  there  are  configurations  in 
PZ  less  favorable  than  the  6*-slippage  configuration.  This  was  a rather 
surprising  result  - at  least  to  the  authors.  A heuristic  explanation  for 
this  phenomenon  is  given  in  the  Appendix. 

Monte  Carlo  simulations  were  carried  out  to  investigate  the 
performance  of  the  ten  procedures  of  Tables  I, I I when  presented  with 
parameter  configurations  other  than  6* -slippage.  For  k = 3,  four 
alternate  configurations  were  chosen , for  k = 10  one  alternate 
configuration.  These  were  chosen  so  that  easy  comparison  could  be  made 


with  Tables  18.5  of  BKS.  Again  P*  = 0.9,  6*  = 0.2,  and  o = 1.0. 

The  results  are  presented  in  Tables  III-VII,  The  format  is  the  same  as 
that  used  earlier  in  Tables  I,  II. 

The  FIXED,  VT  and  PF  rules  do,  of  course,  guarantee  the  probability 
requirement  for  the  ranking  problem.  It  is  well  known  that,  under 
certain  parameter  configurations,  the  sequential  procedures  lead  to 
considerably  lower  ASN's  than  does  the  single-stage  procedure.  As 
between  VT  and  PF,  overall  PF  has  of  course  a much  lower  ITN,  and  it  appears 
to  have  a comparable  or  lower  ASN.  Also  the  overprotection  (PCS  actually 
attained)  seems  to  be  greater  for  the  Paulson  procedure  even  with  the 
Fabian  modification.  Bechhofer  and  Ramberg  (1977)  compare  the  VT  and  PF 
procedures  in  much  greater  detail. 

One  desirable  feature  of  any  ranking  procedure  is  that  the  ASN 
should  not  be  greatly  increased  by  the  introduction  of  extra  "non -con tending" 
populations  with  means  6 < < 0.  The  Paulson  procedure  does  not  have  this 
feature  since  the  continuation  region  is  enlarged  as  k increases  and  it 
takes  longer  for  the  contending  populations  to  be  eliminated.  On  the 
other  hand,  the  number  of  observations  on  the  "contending"  populations 
is  fairly  insensitive  for  procedures  involving  the  BKS  stopping  rule. 

Concerning  the  almost  sure  termination  of  the  adaptive  rules.  Lemma 
1 of  Section  2 holds  whenever  > 6*/2.  The  proof  is 

unchanged.  The  simulation  results  from  Table  III  suggest  that  this  result 
should  hold  even  if  the  restriction  is  removed.  As  with  any  of  the 
procedures,  if  one  is  concerned  about  a large  variance  of  the  sample  size, 
bounded  procedures  can  be  constructed  which  still  enjoy  some  of  the 
benefits  of  adaptive  sequential  sampling.  To  do  this,  define 
P'  + P"  = 1 + P*,  use  the  adaptive  sequential  procedure  but  with  P' 


Table  III  Simulation  results 


PF(0) 

121.59 

(3.0) 

94.93 

114.51 

(3.0) 

93.97 

117.98 

(3.1) 

98.53 

354.09 

(7.4) 

232.93 

cm  i 

CD 

CO 

in 

CM  CD 

CM  ^ It 

CM  L*  i 

to  CD 

CM 

co  in  oo 

CD  U.  It 

O zt 

M»  O 

00 

• 

• • 

• * • 

• • • 

• • • 

r- 

ro  — i z 

it  H H 

to  rH  CD 

zt  CO  o 

co 

u. 

O ... 

o w in 

O w zt 

•H  s*-/  CM 

• 

cu 

*H 

H 

CO  H 

o 

PF  (.5) 

1 

93.23 

(1.2) 

38.97 

98.16 

(1.3) 

40.02 

98.49 

(1.2) 

38.60 

294.61 

(2.9) 

91.23 

0.673 

CO  r-%  CD 

O ^ to 

CO  r— s CM 

to  ^ in 

1 

rH 

O CO  h 

O 00  rH 

h-  CO  CO 

CO  CM  CO 

CO 

• • 

• • • 

• • • 

• • • 

• • • 

o 

rH 

CD  CM 

CD  CM  CO 

CO  CM  Is 

to  CO  CD 

CT> 

• • 

rH  W co 

rH  ^ CO 

rH  w CO 

in  w in 

• 

rH 

• — 1 

rH 

CO  CM 

o 

Zt  ^ U"> 

r-  3- 


H fO  (D 
(M  vO 


CO 

/^V 

LO 

It 

CM 

CO 

in 

zt 

CO 

CM 

CO 

< — 1 

1 — 1 

Zt  CO 

r-  cm  to 


to  co  o 
•H  W O 


CD  ^ H 
£■".  (Jl  rH 


CD  CO  CM 
*H  w CM 


CO  ^ O 
tO  it  CO 


to  CO 
rH  w O 


o 

CD 

CD 

to 

CO 

CO 

CD 

zt 

to 

o 

CO 

CO 

m 

CD 

rH 

to 

«H 

in 

CD 

in 

s-r 

CO 

co 

CM 

CO 

to 

CO 

<D 

r** 

O 

CO 

co 

CO 

Vr< 

CM 

»H 

rH 

00  ^ tO 
in  in  H 


CO  CO  o 

CM  w rH 


tO  LO 
CM  LO  CD 

in  co  h 

CM  w rH 

rH  rH 


CO  ^ CO 
O it  to 

cd  co  m 

' to 

CO  CM 


o 

in  CM  CD 


in  cm  o 

CO  CO  rH 


o 

CD 

rH 

o 

in 

O 

CO 

in 

CO 

CD 

rH 

t>  ^ to 
to  cm  co 


co  co  o 

rH  W o 


o 

/—* s 

CD 

»H 

o 

in 

O 

CO 

m 

CO 

'w- 

CD 

*H 

o 

CD 

rH 

o 

in 

O 

CO 

m 

CO 

CD 

rH 

CM 

/'“'s 

CD 

CD 

CD 

00 

r- 

CO 

zt 

CO 

CM 

CD 

/-N 

CM 

rH 

O 

CD 

to 

CD 

'w' 

CO 

CO 

CM 

Table  IV  Simulation  results 


in  /-s  co 

CM  CD  CM 


St  ID 
ID  00  CM 


CO  CD 

m ^ cm 

CO  00  o 

CM  rH  »H 

• • • 

• • • 

cm  o in 

lo  cm  m 

3-  CN 

CO  ' tD 

•H 

/*“N 

in 

•H 

00 

in 

• 

• 

• 

in 

o 

in 

o 

CD 

ID 

st 

• 

• • 

o 

O CM 

ID 

w CM 

00 

CO 

in 

CO  rH  1 

• 

• • 

o in 

, st 

w CM 

CO 

rf'—s 

CO 

3- 

r- 

CO 

• 

• 

• 

CD 

o 

CO 

St 

V—' 

CM 

CD 

/-N 

m 

i 

C^ 

c- 

• 

• 

• 

CO  ^ O 
CO  00  O 


lO  o ^ 
st  w CM 


co  r- 
(N  O H 


CD 

o 

* 

• 

• 

• 

C- 

CM 

CO 

st 

ID 

CD  r’-*  CD 

O ^ 00 

C'*  CD 

CM  CM 

st  c***  in 

CM  rH  CO 

• • • 

• • • 

• • • 

00  O CM 

00  O CM 

st  cm 

CO  W CM 

CO  W CM 

CM  w CD 
•H 

CM  ^ O 
CM  CM 


a»  ^ H 

CO  CD  rH 


id  o m 

^ V(N 


CM 

CD 

in 

00 

m 

00 

o 

st 

CM 

>-✓ 

CM 

CO 

00 

o 

00 

o 

CD 

• 

• 

o 

o 

s_^ 

/*N 

CM 

CD 

00 

CD 

• 

• 

• 

St 

rH 

CD 

CD 

in 

•H 

CO 

CM 

o 

CD 

o 

CD 

• 

• 

o 

O 

»H  ^ in 

in  cd  h* 

CD 

• • • 

00 

in  cm  o 

CD 

CM  W 00 

rH 

o 

/-s 

CO  H co 


ID  CM  CD 
CO  CD 


st 
st  o 
00  o 

<D  • 

• O 

O W lo 


in  r-N  CO 

CD  CM  in 

r-s 

St 

CD  O 

00 

• • • 

CO  O 

O 

00  CM  00 

CD  • 

CD 

CM  W CD 

• O 

• 

*H 

O w 

o 

CD  H 

CD  ^ H 

ID  ^ CO 

Kn 

st 

00  CD  fH 

CO  CD  H 

id  co 

St  O 

st 

• • • 

• • • 

• • • 

oo  o 

•H 

id  o in 

id  o m 

o cm  in 

CD  • 

CD 

St  CM 

St  w*  CM 

St  w 

1 »“* 

• O 

O w 

O 

c v 
o o w 

•HOC 

v £ o 

6 «H 
O O 4-> 
O.  O 3 
O <H 

fc.  O 

£ • v) 


Table  V Simulation  results 


Table  VI  Simulation  results 


I 13 

Q) 

r-i  U) 

CN  fO 


CM  ^ rH 

00  ' rH 

H CD  00 

m to  cn 

• • • 

• • • 

to  co  cn 

co  co  o 

r~  w j- 

r-  w in 

CM 

CO 

co 

cn 

in 

• 

• 

• 

CM 

CM 

rH 

tO  CD 

r-*  cn  zt 


CM  CM  O 
t*'  ^ (O 


id  in 
•H  CD 


co  d*  in 
co 


CN  r-'  C"* 
CD  CM  CO 


(J)  /■“*'  o 
cn  co 


o co  CD 
cn 


CO  «*-* 

H CM  h 


ct  zr  cn 

O w in 


CO  ^ to 
^ O CO 


CM  CO  CM 

J V i 


CM  ^ 00 
00  CO  00 


CM  r-*  00 
00  CO  CO 


CO  H CM  CO  H CN 

to  >»✓  co  in  v-/  co 


oo  ^ cn 
cn  m co 


co  ^ co 
co  co  oo 


O)  H 00 
lO  V H 


cn  ^ co 
H CM  H 


H CN  H 
h v CO 


CO  ^ CO 
CT>  00  rH 


O*  (N  O 
M vj- 


rl  ^ 00 
CO  ^ H 


oo  H cn 
CM  W H 


187.63 

(7.7) 

108.36 

0.975 

(0.011) 

i 

181.99 

(6.3) 

89.47 

0.950 

(0.015) 

188.56 

(4.9) 

69.30 

0.970 

(0.012) 

I 

rH  ^ 00 
O O CO 

rH  cn  CO  1 
00  W CM 
rH  Hj 

1 

0.885 

(0.023) 

181.02 

(9.9) 

139.97 

0.925 

(0.019) 

159.43 

(5.9) 

83.19 

0.930 

(0.018) 

Zt  ^ r-\' 

CM  O*  to  *H 

• • *i  co  o 

o co  m,  cn  • 

CO  w &> ! • o 

rH  O W 


CM 

/-V 

00 

00 

CD 

00 

• 

• 

• 

CD 

*H 

CN 

in 

w CO 

ID  ^ I to  Zt 

4-  CO  CO'  CM  rH 

• • H O 

o d-  Oj  cn  • 

h v H • O 

rH  rH  O w 


CO  rH  ! rH  ^ 00:  ' 

o cm  in  cooo,  to  rH 

• • • I • • • ; co  o 

co  H co  cm  co  ml  cn  • ' 

CM  w H I w 00,  . o J 

•H  O w 


1 1 

I 

C 

m1 

I 

o o 

c 

! 

j 

•H  V 

o 

V E 

•H 

i o 1 

E 

CN  rH  - E- 

o o 

o 

<y 

Z Z ! Z 

(X  u 

0): 

i 

o 

*H 

1 

ft  Mh 

0) 

£ o 

CO 

912  .907  .906  .906  .906  .909  .947  .936  .933 


Table  VII  Simulation  results 


Frequency  population 
selected  out  of  200 


2 

raplacing  P*,  but  take  no  mere  observations  than  prescribed  by  the 
fixed  sample  size  rule  (Bechhofer  1954)  for  PCS  = P".  This  idea  was 
suggested  by  BKS  (p.  227). 

An  advantage  of  the  VT  and  PF  rules  not  enjoyed  by  the  adaptive 
rules  is  that  there  is  a "time-blocking"  effect.  This  might  be  important 
if,  for  instance,  overall  quality  of  treatment  improves  with  time. 


7.  Directions  of  future  renenrch. 


It  is  disappointing  that,  in  the  ranking  problem,  the  BKS  stopping 
rule  (2.1)  does  not  guarantee  the  requirement  that  PCS  >_  P*  whenever 
0 j-^-j  - Gj-k  >_6  * independently  of  the  sampling  rule  when  k :>  3.  Thus 
the  adaptive  procedures  of  Section  3 cannot  be  regarded  as  a satisfactory 
solution  to  the  general  ranking  problem.  It  is  possible  that  the 
6-slippage  configuration  is  least  favorable  within  a restricted  class  of 
configurations  such  as  perhaps  the  6 -slippage  (GLF)  configurations  where 
6 >_6*/2  is  unknown.  Alternatively  it  may  be  possible  to  construct 
sampling  rules  (other  than  VT)  for  which  the  6*-slippage  configuration 
is  least  favorable.  Further  investigation  is  needed  into  these  questions. 

In  the  past,  there  seems  to  have  been  little  work  done  on  adaptive 
sampling  for  identification  or  ranking  problems  with  k >_  3.  Many  of 
the  arguments  used  by  authors  for  k = 2 do  not  seem  to  carry  over 
easily.  For  Bernoulli  sbservations  on  k >_  3 populations,  Sobel  and 
Weiss  (1972)  have  discussed  "Play- the- Winner"  sampling  and  inverse 
stopping  rules  — see  also  Berry  and  Young  (1977). 

For  the  LI  identification  problem  the  potential  savings  in  ITN  and 
ASN  might  seem  to  justify  the  more  complex  allocation  rules  of  the 
adaptive  procedures.  Other  procedures  that  might  also  be  of  interest, 
but  were  not  compared  here,  are  those  of  Box  and  Hill  (1967)  and  Blot 
and  Meeter  (1973).  No  attempt  here  was  made  to  find  optimal  sampling  rules 
for  the  LI  identification  problem  — it  is  possible  that  the  approach  of 
Kiefer  and  Sacks  (1963)  might  lead  to  optimal  fully  sequential  procedures . 
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APPENDIX 

Here  we  give  heuristic  arguments  concerning  some  sufficient 
conditions  in  the  ranking  problem  for  the  non-consistency  of  the 
procedure  (2.1)  when  adaptive  sampling  rules  are  employed.  This 
gives  insight  into  why  the  6*-slippage  configuration  is  not  least 
favorable  and  it  is  possible  for  PCS  < P*  for  some  configurations  in 
the  preference  zone. 

Recall  z.  = n (X.  - X - (N  - n.)6*/2).  W.l  .o.g.  assume 

1#  ( ( 1 

— w2  — * * ' — wk*  SuPP°se  also  N » -0  with  n^  » 0 

1 < j < n.  Also  let  n.  = max  (n.)  and  n.  » n,  . For  the  adaptive 

l<j<k  : 1 k 

sampling  rules  discussed, this  situation  can  occur  with  positive  probability 

due  to  an  "unlucky  start."  Suppose  n,.  = C-N  and  on  some  set  R’ 

of  positive  probability  C tends  to  a limit  a where  a is  bounded 

away  from  0 and  1.  ( For*  araiiTnoivf  *Hia+-  i i * 

need  is  that  lim  inf  C > 0 and  lim  sup  C < 1 on  some  set  R ' 

N-x» 

of  non-zero  probability.  There  is  also  a condition  analogous  to  (Al) 
if  o = 1 on  some  fl'  of  non-zero  probability.) 

Then  by  use  of  the  law  of  the  iterated  logarithm  it  is  possible  to 
show  that  oh  R ' 

(AD  z.  - z*  ct(l  - a)N(u  - S*/2)  - a J n.p.+o(N).  If  the 

configuration  {p . } is  such  that  z.  - z.  +®  as  N •♦•  <*>  then 

Q 1»  procedure  will  stop  but  IK  will  be  incorrectly  selected 

as  best.  From  (Al),  this  happens  if 

k 

(A2)  Np  . - (1-  a)N6*/2  - £ n.p.  ■+■  + <*>. 

j=l  3 ^ 

For  the  6 -slippage  (GLF)  configuration  this  cannot  happen  since 
the  expression  (A2)  tends  to  - « and  eventually  z^  > z^.  (Of  course. 


for  6 = 5*  (the  LI  identification  problem)  we  already  have  the 
consistency  of  the  procedure  (2.1)  for  any  symmetric  sampling  rule 
that  terminates  almost  surely. ) 
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However,  for  a general  configuration  of  the  (Uj),  it  is  quite 
possible  for  condition  (A2)  to  hold  (e.g.  for  the  equally-spaced 
configuration  with  p.+1  - y..  = 6,  1 <_  j <_  k - 1). 

The  question  arises  concerning  the  kind  of  sampling  rules  for  which 
this  phenomenon  can  occur,  i.e.,  roughly  speaking,  when  we  can  have 
n.  > > n,  for  i i k and  n,  > >0.  Unfortunately  it  occurs 
for  rules  like  RAND  Q and  GRS,  which  have  the  feature  of  being  more 
likely  to  sample  next  from  the  population  with  the  highest  current 
value  of  z_.  (with  the  aim  of  reducing  ITN).  For  these  rules,  due 
to  an  unlucky  start  it  is  quite  possible  for  z^  > after  some 
stage  of  sampling.  For  such  rules  this  is  likely  to  Imply  that  n^  > 

Then  is  more  likely  to  be  sampled  on  next  and,  if  (A2)  is  satisfied,  the 

difference  z^  - z^  is  likely  to  be  increased,  thus  perpetuating  and 
worsening  the  condition. 
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y y V/  C>  * ■ C*  *»  £>.i  <*iiV  ll  r~wt  legality  by  MocJ*  r>:i.nl»»i) 

Adaptive  .sampling;  sequential'  analysis;  ranking  and  selection;  elimination  , 
procedures;  identification  problems;  slippage  problems;  Paulson  procedure; 

_ da 1 1 dependent  alloc.nt ion;  normal  distribution. 

• i Mx1' jf  n ACT  fCtiOlhtuti  cn  r tv  rt  * -*  *»£»«•  It  n*  : * ,*r>  \:y  m:f  U.'tat^ly  .'•/  !*!<•  ck  nu,r*'»^ 

There  are  k (>_  2)  comp  a ling  normal  populations  with  common  known 

variance  and  unknown  means  °i  ’"2’  ‘ ' ' ,(V  * ^et  ®p  j ^ •••  < -|  denote 

•tiv  ordered  values  of  the  (0 . } . Nothing  .is  known  concerning  the  pairing  of 

the  to.]  and  the  { 0 _ ...  1 . In  the  location  invariant  i.dunti  final*  Lon  problem , 
a l.U 

Hi  : d i Florences  {0r...  ..  0 . } are  given  and  it  is  desired  to  select  the 


l i»  i«..!  

• *V.H  I*  t 1,*.^  (»l  l 1 1 1 % I A •.  I /», 


population  associated  with  0 1 ^.-j - Of  particular  interest  is  the  slippage 


con  f igura t ion  0 


u:  i 


°Lk-l]  = °tk]  " where  6*  > 0 is  given. 


We 


restrict  attention  to  procedures  that  guarantee  that  the  population  with  the 
lai'gost  mean  is  correctly  selected  with  probability  at  least  P*  where 
k < P-  < 1 is  preassigned.  Essentially,  this  requirement  is  satisfied  by 
“the  stopping  rule  of  Bechhofcr,  Kiefer  and  Sobel  (Sequential  Identification 
and  P.-anking  Procedures , Univ.  of  Chicago  Press  1968,  Chap.  3)  independently 
of  the  sampling  rule  and  thus  data  dependent  allocation  rules  can  be  considered. 
Unlike  the  case  k = 2 (see  Robbins  and  Siegmund  (1974)  J.  Ain,  Statist.  Assoc. , 
69,  132-139),.  when  k ^ 3,  substantial  savings  in  expected  total  sample 
size  can  be  obtained  when  adaptive  sampling  is  used  instead  of  the  equal- 
allocation  rule  ("vector  at  a time"  sampling).  Several  procedures  are  proposed 
and  investigated  with  respect  to  this  criterion.  FAlso  their  performance 
with  regard  to  the  alternative  criterion  of  minimizing  expected. number  of 
observations  on  the  inferior  populations  is  studied.  ’Both  theoretical  and  ' 
simulation  results  are  presented.  Comparisons  are  made  with  the  performance 
of  el  im  ination- type-  procedures , such  as  that  of  Paulson  (1964),  Ann.  Math. 
Statist. , 35,  174-180.  - 

Finally,  for  the  ranking  problem  where  the  differences  0 - O', 

• LiJ  LIJ 

(2  < i <_k)  are  now  unknown , it  is  shown  that,  when  adaptive  sampling 
rules  are  used,  the  slippage  configuration  is  no  longer  necessarily  least 
favorable  for  the  usual  indifference  zone  approach.  This  poses  some  interesting 
. problems  for  future  research. 


